SPSS Shapiro-Wilk Test - The Ultimate Guide https://www.spss-tutorials.com/spss-shapiro-wilk-test-for-normality/ 


>+ SPSS TUTORIALS 
BASICS ANOVA REGRESSION FACTOR CORRELATION 


SPSS Shapiro-Wilk Test — Quick Tutorial with E> 


By Ruben Geer \@éinapiro-Wilk Test - What is It? 
den Berg | Shapiro-Wilk Test - Null Hypothesis 
under Statistics A-Z & 
Basics ° Running the Shapiro-Wilk Test in SPSS 
—~~~~ ~ e Shapiro-Wilk Test - Interpretation 


e Reporting a Shapiro-Wilk Test in APA style 


Shapiro-Wilk Test - What is It? 


The Shapiro-Wilk test examines if a variable 
is normally distributed in some population. 


Like so, the Shapiro-Wilk serves the exact same purpose as the 
Kolmogorov-Smirnov test. Some statisticians claim the latter is worse due 
to its lower statistical power. Others disagree. 


As an example of a Shapiro-Wilk test, let's say a scientist claims that the 
reaction times of all people -a population- on some task are normally 
distributed. He draws a random sample of N = 233 people and measures 
their reaction times. A histogram of the results is shown below. 


Frequency Distribution Reaction Times Total Sample | N = 233 
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Reaction time trial 1 


This frequency distribution seems somewhat bimodal. Other than that, it 
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looks reasonably -but not exactly- normal. However, sample outcomes 
usually differ from their population counterparts. The big question is: 


how likely is the observed distribution if the reaction times 
are exactly normally distributed in the entire population? 


The Shapiro-Wilk test answers precisely that. 


How Does the Shapiro-Wilk Test Work? 


A technically correct explanation is given on this Wikipedia page. 
However, a simpler -but not technically correct- explanation is this: the 
Shapiro-Wilk test first quantifies the similarity between the observed and 
normal distributions as a single number: it superimposes a normal curve 
over the observed distribution as shown below. It then computes which 
percentage of our sample overlaps with it: a similarity percentage. 


SHAPIRO-WILK NORMALITY TEST 
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@ NORMAL 
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Frequency 
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OBSERVED DISTRIBUTION FOLLOWS THEORETICAL DISTRIBUTION? 


Finally, the Shapiro-Wilk test computes the probability of finding this 
observed -or a smaller- similarity percentage. It does so under the 
assumption that the population distribution is exactly normal: the null 
hypothesis. 


Shapiro-Wilk Test - Null Hypothesis 
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The null hypothesis for the Shapiro-Wilk test is that 
a variable is normally distributed in some population. 


A different way to say the same is that a variable’s values are a simple 
random sample from a normal distribution. As a rule of thumb, we 


reject the null hypothesis if p < 0.05. 


So in this case we conclude that our variable is not normally distributed. 
Why? Well, p is basically the probability of finding our data if the null 
hypothesis is true. If this probability is (very) small -but we found our data 
anyway- then the null hypothesis was probably wrong. 


Shapiro-Wilk Test - SPSS Example Data 


A sample of N = 236 people completed a number of speedtasks. Their 
reaction times are in speedtasks.sav, partly shown below. We'll only use 
the first five trials in variables r01 through r05. 


ta speedtasks.sav [] - IBM SPSS Statistic: 
File Edit View Data Transform Analyze Graphs Utilities Extensions Window Help 


Name — Lued J Label 
1 id a pA P Nc 
2 sex n e SEX {.0 
3 \agegroup ...... Age Group {1 
4 r01 ...... Reaction time trial 1 Nc 
5 r02 .... Reaction time trial 2 Ne 
6 r03 .... . Reaction time trial 3 Ne 
T r04 ..... Reaction time trial 4 Nc 
8 lro5 ...... Reaction time trial 5 INe 


| recommend you always thoroughly inspect all variables you'd like to 
analyze. Since our reaction times in milliseconds are quantitative 
variables, we'll run some quick histograms over them. | prefer doing so 
from the short syntax below. Easier -but slower- methods are covered in 
Creating Histograms in SPSS. 


*Quick histograms with normal curves as data check. 


frequencies r01 to r05 


/format notable 
/histogram normal. 


Results 
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Note that some of the 5 histograms look messed up. Some data seem 
corrupted and had better not be seriously analyzed. An exception is trial 4 
(shown below) which looks plausible -even reasonably normally 
distributed. 


Reaction time trial 4 


Frequency 


0 500 1000 1500 2000 


Reaction time trial 4 


Descriptive Statistics - Skewness & Kurtosis 


If you're reading this to complete some assignment, you're probably 
asked to report some descriptive statistics for some variables. These 
often include the median, standard deviation, skewness and kurtosis. 
Why? Well, for a normal distribution, 


e skewness = 0: it's absolutely symmetrical and 


e kurtosis = 0 too: it's neither peaked (“leptokurtic”) nor flattened 
(“platykurtic”). 


So if we sample many values from such a distribution, the resulting 
variable should have both skewness and kurtosis close to zero. You can 
get such statistics from FREQUENCIES but | prefer using MEANS: it 
results in the best table format and its syntax is short and simple. 


*Descriptives table. 


means r01 to r05 


/cells count mean median stddev skew kurt. 


*Optionally: transpose table (requires SPSS 22 or hig 
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output modify 
/select tables 


/if instances = last /*process last table in output, 
/table transpose = yes. 


Results 

Report 
N Mean Median Std. Deviation Skewness Kurtosis 

Reaction time trial 1 233 969.97 932.00 275.318 341 -.394 
Reaction time trial2 235 476.77 267.00 365.437 1.244 032 
Reaction time trial3 233 1001.57 954.00 341.493 2.211 9.654 
Reaction time trial4 226 980.96 969.00 267.895 373 181 
Reaction time trial5 235 1009.52 970.00 348.779 | 2.625 | 12.439 | 


Trials 2, 3 and 5 all have a huge skewness and/or kurtosis. This suggests 
that they are not normally distributed in the entire population. Skewness 
and kurtosis are closer to zero for trials 1 and 4. 

So now that we've a basic idea what our data look like, let's proceed with 
the actual test. 


Running the Shapiro-Wilk Test in SPSS 


The screenshots below guide you through running a Shapiro-Wilk test 
correctly in SPSS. We'll add the resulting syntax as well. 


& speedtasks.sav [] - IBM SPSS Statistic: 
File Edit View Data Transform “Analyze Graphs Utilities Extensions Window Help 
delete] Reports 


nan Ne 
Compare Means c 
General Linear Model { 
agegroup Correlate {1 
Explore a Explore: Plots 
Dependent List: Boxplot: = 
O01 ss INCLUDES i 
$ o2 ©) Aa SHAPIRO-WILK TEST 
O nr { Options.. | © Dept 
Factor List: | © None fl | 
m ® 
© Ciee 
Label Cases by: 
ta Explore: Options 


Display 
© Both © Statistics © Plots (2) | 


| Missing values = | 
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© (Gems) cone) tte) 


Following these screenshots results in the syntax below. 


*Shapiro-Wilk test pasted from Analyze - Descriptive 


EXAMINE VARIABLES=r01 r02 r03 r04 r05 
/PLOT BOXPLOT NPPLOT 

/COMPARE GROUPS 
/STATISTICS DESCRIPTIVES 
/CINTERVAL 95 


/MISSING PAIRWISE /*IMPORTANT! 
/NOTOTAL. 


Running this syntax creates a bunch of output. However, the one table 
we're looking for -“Tests of Normality”- is shown below. 


Shapiro-Wilk Test - Interpretation 


Tests of Normality 


Shapiro-Wilk (= p 


Statistic df Sig 


Reaction time trial 1 .984 233 


Reaction time trial 2 23 


Reaction time trial 3 : 233 
Reaction time trial 4 ‘ 226 


Reaction time trial 5 d 235 


P < 0.05? REJECT H, OF NORMAL DISTRIBUTION IN POPULATION 


We reject the null hypotheses of normal population distributions 
for trials 1, 2, 3 and 5 at a = 0.05. 


“Sig.” or p is the probability of finding the observed -or a larger- deviation 
from normality in our sample if the distribution is exactly normal in our 
population. If trial 1 is normally distributed in the population, there's a 
mere 0.01 -or 1%- chance of finding these sample data. These values are 
unlikely to have been sampled from a normal distribution. So the 
population distribution probably wasn't normal after all. 


We therefore reject this null hypothesis. Conclusion: trials 1, 2, 3 and 5 
are probably not normally distributed in the population. 
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The only exception is trial 4: if this variable is normally distributed in the 
population, there's a 0.075 -or 7.5%- chance of finding the nonnormality 
observed in our data. That is, there's a reasonable chance that this 
nonnormality is solely due to sampling error. So 


for trial 4, we retain the null hypothesis 
of population normality because p > 0.05. 


We can't tell for sure if the population distribution is normal. But given 
these data, we'll believe it. For now anyway. 


Reporting a Shapiro-Wilk Test in APA style 


For reporting a Shapiro-Wilk test in APA style, we include 3 numbers: 


the test statistic W -mislabeled “Statistic” in SPSS; 


its associated df -short for degrees of freedom and 


its significance level p -labeled “Sig.” in SPSS. 


The screenshot shows how to put these numbers together for trial 1. 


Tests of Normality 


W Shapiro-Wilk P 


Statistic df Sig 
Reaction time trial 1 .984 233 .010 


“A Shapiro-Wilk test showed a significant departure from normality, W(233) = 0.98, p = 0.01.” 


Limited Usefulness of Normality Tests 


The Shapiro-Wilk and Kolmogorov-Smirnov test both examine if a 
variable is normally distributed in some population. But why even bother? 
Well, that's because many statistical tests -including ANOVA, t-tests and 
regression- require the normality assumption: variables must be 
normally distributed in the population. However, 


the normality assumption is only needed for small sample sizes 


of -say- N < 20 or so. For larger sample sizes, the sampling distribution of 
the mean is always normal, regardless how values are distributed in the 
population. This phenomenon is known as the central limit theorem. And 
the consequence is that many test results are unaffected by even severe 
violations of normality. 

So if sample sizes are reasonable, normality tests are often pointless. 
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Sadly, few statistics instructors seem to be aware of this and still bother 
students with such tests. And that's why | wrote this tutorial anyway. 


Hey! But what if sample sizes are small, say N < 20 or so? Well, in that 
case, many tests do require normally distributed variables. However, 


normality tests typically have low power in small sample sizes. 


As a consequence, even substantial deviations from normality may not be 
statistically significant. So when you really need normality, normality tests 
are unlikely to detect that it's actually violated. Which renders them pretty 
useless. 


Thanks for reading. 


Tell us what you think! 
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Done! 


*Required field. Your comment will show up after approval from a moderator. 


THIS TUTORIAL HAS 22 COMMENTS: 


By Chancha terhemba on June 16th, 2022 


Well explained and straightforward. 
Thanks for such a wonder work.. 
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By So on June 18th, 2022 


Thank you very much for your kind explanation in your 
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